Zero Pearson Coefficient for Strongly Correlated Growing Trees 
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We obtained Pearson's coefficient of strongiy correiated recursive networks growing by preferential 
attachment of every new vertex by m edges. We found that the Pearson coefficient is exactly zero 
in the infinite network limit for the recursive trees (m =1). If the number of connections of new 
vertices exceeds one (m > 1), then the Pearson coefficient in the infinite networks equals zero only 
when the degree distribution exponent 7 does not exceed 4. We calculated the Pearson coefficient 
for finite networks and observed a slow, power-law like approach to an infinite network limit. Our 
findings indicate that Pearson's coefficient strongly depends on size and details of networks, which 
makes this characteristic virtually useless for quantitative comparison of different networks. 
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The Pearson coefficient r is used as an integral charac- 
teristic of structural correlations in a network. Pearson's 
coefficient characterizes pairwise correlations between de- 
grees of the nearest neighboring vertices in networks. 
Some observable quantities in correlated networks (e.g., 
the size of a giant connected component near the point of 
its emergence) are directly expressed in terms of this co- 
efficient p], 0] ■ The Pearson coefficient is the normalized 
correlation function of the degrees of the nearest neigh- 
bour vertices [1, 0, 0j- The coefficient is the ratio: 



(jfc)e-(fc) c 2 ((J - <fc)c)(fc ~ <fc) )) c 
(fc 2 )c - (fc) 2 



(1) 



where (jk) is the average product of the degrees j and fc 
of the end vertices of an edge, (k) c = (k 2 ) / (k) is the aver- 
age degree of an end vertex of an edge, (k 2 ) c — (k 3 )/(k) 
is the average square of the degree of an end vertex of an 
edge [6], and 



(fc 3 ) (fc 2 ) 



2\2 



(fc) {kf 



(2) 



is for normalization. Here (. . .) and (. . .) c denote aver- 
aging over vertices and edges, respectively [see Eqs. (fl"6|). 
(fT8|) . and (|20|) below]. Pearson's coefficient can be posi- 
tive (in average, assortative mixing of the nearest neigh- 
bors' degrees) or negative (disassortative mixing) and 
takes values in the range from —1 to 1. The Pearson 
coefficient r is a convolution of the joint distribution of 
nearest neighbor degrees, P(j,k) = e.jk- This joint dis- 
tribution is the probability that the ends of a randomly 
chosen edge have degrees j and fc, J2jk e 3 k = 1- Being an 
integral characteristic of degree-degree correlations, the 
Pearson coefficient misses details of these correlations, 
compared to ejk @,H,[3|- Despite this fact, Pearson's co- 
efficient is widely used for characterization and compar- 
ison of real- world networks [4J. Note that the compared 
real networks have different sizes. In this paper we show 
that since r is a markedly size dependent quantity, Pear- 
son's coefficient may be used for comparison of networks 
only with a very critical attitude. We calculate Pearson's 
coefficient for the simplest growing complex networks, 



namely recursive random networks with preferential at- 
tachment of new vertices. We describe the size depen- 
dence of r in these networks, Fig. Q] Remarkably, for all 
infinite recursive trees of this kind, we find that the Pear- 
son coefficient is exactly zero at any value of the degree 
distribution exponent 7, although all these networks are 
strongly correlated. The statement that Pearson's coeffi- 
cient is zero in the range 7 > 4 is essentially non-trivial. 
The point is that zero value of the Pearson coefficient in 
the range 7 < 4 where the third moment of the degree 
distribution diverges is clear. Indeed, in this region, the 
denominator in definition (fT]) diverges in the infinite net- 
work limit [lfj. This is not the case at 7 > 4, and for zero 
r, the numerator in Eq. fl} must be zero. In this range, 
zero value of Pearson's coefficient means a surprising ex- 
act mutual compensation of different degree-degree cor- 




FIG. 1: Pearson's coefficient r vs. degree distribution ex- 
ponent in the recursive networks with m = 1 and 2 for the 
networks of 10 3 (x), 10 4 (*), 10 5 (o), and 10 6 (+) vertices. 
The thick lines show the Pearson coefficient for the infinite 
networks. 
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relations in these trees. There are two opposing kinds of 
degree-degree correlations in complex networks: assorta- 
tive and disassortative. Here assortativity is a tendency 
of high degree vertices to have high degree neighbors and 
low-degree vertices to have low degree neighbors. In con- 
trast, disassortative mixing means neighborhood of ver- 
tices with contrasting (low and high) degrees. This ten- 
dencies may be opposing in different ranges of degrees, 
see discussion in Sec. IIII1 Fig. |4j That is, assortative and 
disassortative mixing may coexist. In particular, this is 
the case for random recursive trees. Our results show 
that in these growing networks, the different kinds of 
correlations completely compensate each other in the in- 
finite network limit. 
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FIG. 2: Variation of exponent z [5r(t) = r(t) — oc t z ] 
with 7 exponent of a degree distribution. 



I. MAIN RESULTS 

We study recursive networks in which each new vertex 
is attached to m existing ones chosen with probability 
proportional to a linear function of vertex degree, k+A = 
k+am. This rule generates scale-free correlated networks 
with a degree distribution exponent 7 = 3 + A/m = 3 + a. 

In the infinite network limit, Pearson's coefficient 
^00(7 < 4) = 0, since the third moment of the degree 
distribution [see the denominator of Pearson's coefficient 
definition {!])] diverges. Here we define r x = r(t—>oo), 
where t in the number of vertices in a network. For 7 > 4, 
we find 



(m-l)(a-l)[2(l+m) + a(l+3m)] 
(l+m)[2(l+m) + a(5+7m) + a 2 (l+7m)] ' 



(3) 



which shows that for m — 1 (i.e., for random recursive 
trees), Pearson's coefficient = for any value of 7. 
One can also see that 7-00(7 = 4) = for any m. In 
particular, for uniformly random attachment, i.e., 7— >oo, 
we have: 



roc (7— >oo) 



(to - l)(l + 3m) 
(1 + to)(1 + 7m)' 



(4) 



Figure[l] obtained by simulation of the networks of 10 3 , 
10 4 , 10 5 , and 10 6 vertices (the number of runs for each 
point was between 50 and 500 for 7 < 3 and between 5 x 
10 3 and 10 4 for 7 > 3), demonstrates the size dependence 
of Pearson's coefficient in these networks. This figure 
shows that even for large networks, the deviations from 
the limiting infinite network values are significant. We 
find the following asymptotic size dependences Sr(t) — 
r(t) - Too: 



5r(t) oc < 



_^-(7-2)/(7-i) ) ari y TOj 7 <3, 

_i-V(7-i) m = l,) 

> 3 < t < 4 
+r( 4- 7 )/( 7 -i) ) m>l I ' (5) 



_ t -(7-3)/(7-l), m= 1, 

I +i-(7-4V(7-i) m > 1 



7 > 4. 
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FIG. 3: The measured dependences of exponent z on 7 for 
recursive trees (m=l). The solid curve is the theoretical de- 
pendence z( 7 , m=l). The data points are obtained by power- 
law fitting of the Pearson coefficient of simulated trees with 
sizes close to the largest simulated size (10 6 vertices). 



Here we show the signs of the asymptotes but ignore their 
factors. The positive sign of the asymptotes at m>l, 
3<7<4 and 7>4 means that r(t) approaches the infinite 
size limit from above. In this situation (m>I, 7>3), 
Pearson's coefficient varies with size non-monotonously: 
first increases and then diminishes to (see the second 
panel of Fig. [!}. Introducing exponent z: 8r(t) oc t~ z for 
large network sizes t, we arrive at the dependences z(j) 
shown in Fig. [51 Note that z < 1, so the infinite network 
limit is approached slowly. The relaxation to the infinite 
network values, r x , is especially slow if exponent 7 is 
close to 2 (at any to) or to 4 (only if m > I). In the 
specific case of to > I, 7 = 4, we obtain the logarithmic 
relaxation: 



6r(t) oc — . 



(0) 



We measured the dependence of exponent z on 7 in the 
simulated networks. The result, shown in Fig. [31 demon- 
strates an agreement with above analytical predictions. 
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II. PEARSON COEFFICIENT OF INFINITE 
NETWORKS 

Let us derive the Pearson coefficient §5§ of the infinite 
recursive networks. The derivation is based on rate equa- 
tions for the average number N k (t) of vertices of degree fc 
in a recursive network at time t and the average number 
Ejkit) of edges connecting vertices of degrees j and fc. 
We have 



k—m 



for vertices, and so 



N k (t)=t Pk (t), 



(7) 



(8) 



where p k {t) = P(k, t) is a degree distribution of the net- 
work of size t. 

For edges, we have: 



and so 



E jk (t) = 2mte jk (t), 



(9) 



(10) 



where ejk is the degree-degree distribution for edges. 
Using the standard rate or master equation approaches 
[ill fl2L [l3| to this kind of networks (specifically, to the 
recursive networks growing due the preferential attach- 
ment mechanism), we write the following rate equations 
for N k {t) and E jk (t): 

N m (t + 1) = N m {t) + 1 - m— — — N m (t), 



m(2 + a)t 



N k>m (t + I) = N k (t) 
k - 1 



m(2 + a)t 



ma , . k + ma , . , 

N k -i{t) - m———N k (t) (11) 



m(2 + a)t 



(note that the mean vertex degree in these networks is 
(k) = 2m) and 



E m k(t + 1) — E m k(t) 
m + ma 



1 



m(2 + a)t 



. k + ma 

Co i \. E mk{t) H pr— — r-E mk (t) 

m{2 + a)t m(2 + a)t 

E jk>m (t+1) = E jk {t) 

k-l + ma j-l + ma ' 

+ a)t m(2 + a)t 



m{2 

j + ma 
m(2 + a)t 



Ejk(t) 



m{2 + a)t 



(12) 



We separately write out equations for the case of k = m. 
Here we used the fact that the probability to attach a 



new vertex to a vertex i of degree fej in this model is 
fej + ma fcj + a fcj + a 



^(t + mo) ((fc) + ma)t m(2 + a)i 



(13) 



Assuming a stationary regime at the limit t — > cxd 
[iVfc(t) = ipfc, Ejkit) = 2mtej k ], we reduce these equa- 
tions to 

m + ma 

p m = 1 - TO—— -p m , 

m(2 + a) 
k — 1 + ma fc + ma 

Pfe>m = TO ___, ; _ N Pfc-i - m — ^— i p k (14) 



m(2 + a) 



m(2 + a) J 



and 



2m(k + m + 2 + 2ma + a)e mk = (fe — 1 + ma)p k -i, 
m(j + fc + 2 + 2ma + a)e jk>m 

= m(fe — 1 + ma)ej. k -\ — m(j — 1 + ma)e J -_i 1 fc. (15) 

To obtain the Pearson coefficient, see definition (JTJ) , we 
must find (fc 2 ), (A: 3 ), and (Jk) e , where 



(jk) c = ^2jke jk . 
jk 



(16) 



Multiplying both the sides of the second equation of the 
system (|14j) by k 2 and fc 3 and summing over k, and taking 
into account 



2 + a 



2 + a + m + ma 



(17) 



which follows from the first equation of the system (| 14[) . 
we obtain 



and 



(fc 2 ) = V fc V = — (2 + 5ma + a + 2m) (18) 
^-^ a 



(fc 3 ) 



a(a — 1) 

x[6(l+m)+a(5+21m+8m 2 )+a 2 (l+9m+16m 2 )]. (19) 
respectively. 

To get (jfc)o, we multiply both the sides of the second 
equation of the system (|15[) by jk and sum them over j 
and fc. We also take into account the following general 
equality: 



(fc)c = ke 3 k = 



jk 



(fc 2 ) 
(fc) ' 



which gives 



(fc) c = —(2 + 5ma + a + 2m). 



(20) 



(21) 
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As a result we find 



{?'*). 



:[2 + 2TO + a(5 + 7m) + a 2 (2 + 7m)], (22) 



and so at a > 0, i.e., 7 > 3, the numerator of definition 
© is 



1 



4a 2 



[4(1 + to) + 4a(l + 2m) + a 2 (l + 3m)]. (23) 



Importantly, this numerator is zero if m = 1. So we see 
that Too (to = 1) = at least if 7 > 3. On the other hand, 
if to > 1, the numerator is finite in the range 7 > 3. 
This shows that r oc (m > 1) must be zero at 3 < 7 < 
4, since the denominator diverges if 7 < 4. Finally, at 
7 < 3, Too = at any m [see the next section, Eq. (|3"3"|) ]. 
Substituting relations (fl8|) , (jT9|) , and (|23|) into definitions 
([1]) and ([2]) readily gives the resulting expression for 
the stationary value of Pearson's coefficient. 
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III. SIZE DEPENDENCE 

Equations (fTTj) and (fT2"|) allow one to derive the full 
size dependence of the Pearson coefficient. Instead of 
these cumbersome straightforward calculations, we ob- 
tain the asymptotic behavior of r(t) in an easier way, us- 
ing known results for the asymptotics of degree-degree 
correlations in these networks 

M EE M- Th e 

derivation is based on the following expression of the 
Pearson coefficient in terms of the average vertex degree 
fcnn(fc) of the nearest neighbors of the vertices of degree 
fc: 

_ (fc)E fc Mfc-l)Pfc[fcnn(fc)-(fc 2 )/(fc)] 

7 (fc)(fc 3 ) -(fc 2 ) 2 1 ' 

= (fc)E fc fcVfcnn(fc)-(fc 2 ) 2 

(fc)(fc 3 ) - (fc 2 ) 2 ' 1 ' 

This expression directly follows from definition (|TJ) . The 
leading asymptotics of fc nn (fc) can be obtained by using 
the known exact asymptotics of riki for these recursive 
networks Here n k \ is the probability that a de- 

scendant vertex of degree k is connected to an ascendant 
vertex of degree I. This quantity satisfies the following 
relations: 

^ n kl = p fc , 
1 

^2(nki + nik) = kp k , 
1 

22 l(n kl + nik) = kp k k nn (k). (26) 
1 

At m = 1, according to Ref. 

n kl - r- 2 ^ 1 " 27 if 1< I < fc, 

n fei - Z 1 -^- 2 if 1< fc < L (27) 



1 — — — — 1 

12 3 4 

10 10 10 10 10 

fc 



FIG. 4: Average degree of the nearest neighbours of a ver- 
tex versus the degree of this vertex for the scale-free recursive 
networks of various sizes with the degree distribution expo- 
nent values 3.1053 and 4, and m = 1. The solid lines show 
the stationary dependences k nn (k) in the infinite networks. 
These curves were find by numerical solving the stationary 
limits of the rate equations. Both these dependences, substi- 
tuted into expressions (|24[) or (|25[1 give exactly zero Pearson 
coefficient. The positive slope of k nn (k) means assortative 
mixing, while the negative slope means disassortative mixing. 
The non-monotonous dependencies indicate the combination 
of these kind of degree-degree correlations, which, in this case, 
exactly compensate each other. The points show the results 
of simulation of the recursive networks of 10 3 (x), 10 4 (*), 
10 5 (o), and 10 6 (+) vertices. Note a very slow convergence 
of the results for finite networks to the infinite network limit 
at 7 close to 3. 



For 7 > 3 and any m, fc nn (fc,t) approaches a stationary 
limit as the network grows. Figure [4] demonstrates this 
relaxation for two values of the degree distribution ex- 
ponent 7, namely, for 7 = 3.1053 and 7 = 4 in the case 
of to = 1. We can approximate the deviation from the 
infinite network limit by 

/>oo 

(fcnn(fc) - k D n{k,t)]kp k - / dl I (tiki + Tljk), (28) 

-' fccut 

where fc cu t = k cut (t) is the time dependent cutoff of the 
degree distribution. In these networks, k cut (t) ~ t 1 /^ 7-1 ). 
Substituting Eq. ^27J) into Eq. (28|) results in 

^nn(^) &nn(&i^) 

~ C^-H^y^-V + C2 fc 2 7-3 < -(2 7 -3)/( 7 -l) (2Q) 
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at large degrees k < k cut for the networks with 7 > 3. 
Here C\ and C2 are constants depending on 7, which we 
do not calculate. 

Similarly, for 2 < 7 < 3, where k nn (k,t) diverges as 
t — » 00 at any m, we can write an asymptotic estimate 



k nn (k,t)kp k 



dl l(n ki +ni k ). 



(30) 



Substituting Eq. ([57} into Eq. ^ we obtain 



k nn (k,t) ~ cgfcT-^^/CT- 1 ) + c 4 ^- 3 t (3 - 2 ^/^- 1 ) 

(31) 

at large degrees fc < fc cu t for the networks with 2 < 7 < 3. 
Here C3 and C4 are some constants [Tn |. 

We substitute asymptotics ([2T)[) and (J2J) into expres- 
sion ([25)) for the Pearson coefficient and take into account 
the leading terms in the numerator and the denominator. 
The combinations of these leading terms are different in 
different areas of 7 and m, since the quantities k nn (k,t), 
(k 2 ), and (fc 3 ) in expression (|2"5|) change their asymptotic 
behavior at two special points, namely, 7 = 3 and 7 = 4. 
In particular, at these points (7 = 3 and 4) the second 
and, respectively, the third moments of the degree distri- 
bution become divergent. For example, if 2 < 7 < 3 at 
any m, relation (|25p with substituted asymptotics 
takes the form: 



r(i) 



7)/(7-l) 



■1/(7-1) 



dk fc 2 /c- 3+7 fc-T 



2 7 )/(7-l) 



t l/(T-l) 

dkk 2 k 2 ~>- 3 



k~ 




t i/(T-D \ 

2m / dfcfc 3 " 7 



(32) 



where 
result: 

r(t) 



and c'q are constants. This leads to the 



[c'/lni f(3-7)/(7"l) + c ^(3-7)/(7-l)] 



t (4-7)/( 7 -l) 



-(7-2)/(7-l) 



c «£2(3-7)/(7-l) 



(33) 



where c", c 2 ' and c 3 ' are constants. In a similar way, we 
derive the other asymptotics listed in Eqs. (f5]) and ©. 

Interestingly, both the terms in Eqs. (f29[) and (13 lj) give 
contributions of the same order of magnitude to r(t), see 
the numerator of Eq. (|33|) . Note that we suppose that 
for m > 1 the form of the asymptotics of k n n(k,t) is the 
same as in Eqs. ([2T)[) and (|3 1 1) . To verify our assumption, 
we inspected the corresponding results for k nn (k, t) in 
Ref. [15| and found that the asymptotic behavior should 
be similar at different m (apart of numerical coefficients) 
if exponent 7 is fixed. In addition, we checked that re- 
sults © also can be derived by using the asymptotics of 



k n n(k,t) from Ref. [la] . This confirms our conclusions 
about the asymptotics of r(t). 



IV. DISCUSSION AND CONCLUSIONS 

Our result, namely zero Pearson coefficient of random 
recursive trees at any 7 > 2, naturally leads to the fol- 
lowing questions. What is the class of trees that have 
zero Pearson coefficient? Is Pearson's coefficient zero for 
any infinite recursive tree? At present, we cannot answer 
the first question. As for the second question, the answer 
is negative. Indeed, as a counter-example, we present the 
simplest infinite recursive tree with a non-zero Pearson 
coefficient. This is a star, which is a tree of t > 2 vertices, 
including t — 1 leafs and the hub of degree t — 1. For star 
of any size, clearly, r = — 1. 

Thus we have studied the Pearson coefficient in 
strongly correlated growing networks. They form a rep- 
resentative class of networks with strong structural cor- 
relations including pairwise correlations between degrees 
of the nearest-neighbor vertices. Despite these correla- 
tions, we have found that in a wide range of infinite cor- 
related networks the Pearson coefficient approaches zero. 
For any infinite random recursive tree whose growth is 
driven by arbitrary linear preferential attachment of new 
vertices, we observed zero Pearson coefficient. These net- 
works include random recursive trees with rapidly decay- 
ing and even exponential degree distribution, where the 
third moment of the degree distribution is finite. So here 
zero value of Pearson's coefficient demands zero correla- 
tion function in the numerator of definition (fT]) . This sur- 
prising equality to zero indicates an exact mutual com- 
pensation of assortative and disassortative degree-degree 
correlations. In this respect, the recursive trees is a very 
special case of random recursive networks [l8j . 

We have investigated the size dependence of Pear- 
son's coefficient in the growing networks. We have found 
that the size effect is significant even for very large net- 
works. We have shown that a growing network during 
its evolution may demonstrate essential and even non- 
monotonous variation of Pearson's coefficient. Due to 
this marked size dependence, it is hardly feasible to use 
this integral characteristic of correlations for quantitative 
comparison of different real-world networks. Instead, for 
this purpose, one has to use more informative character- 
istics, for example k na (k). 
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